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Controlling Voice Communications O ver A Data Network 

Background 

The invention relates to controlling voice communications over a data network. 
Data networks are widely used to link various types of network elements, such as 
personal computers, servers, gateways, network telephones, and so forth. Data networks may 
include private networks (such a local area networks or wide area networks) and public 
networks (such as the Internet). Popular forms of communications between network 
elements across such data networks include electronic mail, file transfer, web browsing, and 
other exchanges of digital data. 

■ With the increased capacity and reliability of data networks, voice communications 
(including telephone calls, video conferencing, and so forth) over data networks have become 
possible. Voice communications over data networks are unlike voice communications in a 
conventional public switched telephone network (PSTN), which provides users with 
dedicated, end-to-end circuit connections for the duration of each call. Communications over 
data networks, such as IP (Internet Protocol) networks, are performed using packets or 
datagrams that are sent in bursts from a source to one or more destination nodes. Voice data 
sent over a data network typically shares network bandwidth with conventional non-voice 
data (e.g., data associated with electronic mail, file transfer, web access, and other traffic). 

Various standards have been proposed for voice and multimedia communications over 
data networks. One such standard is the H.323 Recommendation from the International 
Telecommunications Union (ITU), which describes terminals, equipment, and services for 
multimedia communications over data networks. 

Another standard for voice and multimedia communications is the Session Initiation 
Protocol (SIP), which establishes, maintains, and terminates multimedia sessions over a data 
network. SEP is part of a multimedia data and control architecture developed by the Internet 
Engineering Task Force (IETF). The IETF multimedia data and control architecture also, 
includes the Resource Reservation Protocol (RSVP) for reserving network resources; the 
Real-Time Transport Protocol (RTP) for transporting real-time data and providing quality of 
service (QoS) feedback; the Real-Time Streaming Protocol (RTSP) for controlling delivery 
of streaming media; the Session Announcement Protocol (SAP) for advertising multimedia 
sessions by multicast; and the Session Description Protocol (SDP) for describing multimedia 
sessions. 
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To perform voice communications over a data network, a typical computer system 
(such as a desktop computer system or a portable computer system) may be equipped with 
voice processing capabilities. Such capabilities include a microphone, ear phones or 
speakers, and speech processing software. Typically, the speech processing software 
5 includes coder/decoders (CODECs) to encode and decode voice data. The voice processing 
software, including the CODECs, may be run on a microprocessor of a typical computer 
system. However, due to the intensive data processing typically required to process voice 
data, speech performance may not be optimum. For example, there may be delays associated 
with the transfer of such voice data due to the amount of time needed to process the voice 

10 data. Also, if certain types of CODECs that have less resource requirements are selected, 
voice quality may suffer. 

Also, the computer system needs to be fitted with speakers, microphones, and sound 
cards to enable speech processing. Further, such speakers, microphones, and sound cards 
may not provide the desired level of quality, or if they do, may be relatively expensive. 

15 Additionally, to add such speech processing components to a computer system may require 
some configuration to be performed by a user, a process that an unsophisticated user may 
have difficulty with. 

Unless a computer system with powerful processing capabilities are provided, the 
voice quality provided by such computer systems are not at the level typically experienced 

20 (and expected) by users of standard telephones. Such "standard" telephones may include 

analog telephones coupled to a local or central switching office or digital telephones coupled 
to a private branch exchange (PBX) system. More recently, network telephones have been 
developed that are capable of being connected directly to a data network, such as an IP 
network. These network telephones are capable of placing telephony calls over a data 

25 network. The voice quality offered by such telephones are typically superior to those that can 
be offered by computer systems, since such network telephones typically include dedicated 
digital signal processors (DSPs) that perform the data intensive calculations involved in 
speech processing. However, the existing network telephones do not provide desired 
multimedia presentation capabilities such as those offered by displays of computer systems. 

30 Thus, while network telephones offer superior speech capabilities, it does have the desired 
multimedia capabilities. On the other hand, computer systems have superior multimedia 
capabilities, but they suffer from relatively poor speech processing performance. 
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A need thus exists for an improved method and apparatus for controlling voice 
communications over data networks. 

Summary 

In general, according to one embodiment, a method of communicating over a data 
network includes communicating, in a control system, one or more control messages over the 
data network to establish a call session with a remote device coupled to the data network. 
One or more commands are transmitted to a voice device coupled to the data network. The 
call session between the voice device and the remote device is established over the data 
network: Information associated with the call session is displayed on the control system. 

In general, according to another embodiment, a method of communicating over a data 
network includes providing a user interface in a control system for establishing call sessions. 
One or more control messages are communicated by the control system over the data network . 
to establish a call session with a remote device in response to receipt of a request through the : 
user interface. One or more commands are transmitted to a voice device associated with a 
control system to establish the call session between the voice device and the remote device 

over the data network. 

Some embodiments of the invention may include one or more of the following 
advantages. The voice processing capabilities of a voice device, such as a network telephone, 
may be advantageously used to provide superior voice quality, while ajghe same time, a 
control system such as a computer may be used to provide a convenient user interface for the 
user to perform call control and to view status and other information relating to the call 
session. Thus, voice quality associated with call sessions over data networks such as packet- 
switched data networks is enhanced using embodiments of the invention. 

Other features and advantages will become apparent from the following description, 
from the drawings, and from the claims. 

Brief Description Of The Drawings 

Fig. 1 is block diagram of an embodiment of a communications system. 

Fig. 2 illustrates components in a network telephone and a call control system in 
accordance with an embodiment. 
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Fig. 3 illustrates control and data paths between network elements used during a call 
session in accordance with one embodiment. 

Figs. 4 and 5 illustrate example screens displayed by the call control system of Fig. 2 
in accordance with an embodiment. 

Fig. 6 is a message flow diagram of messages exchanged between network elements 
in the communications system of Fig. 1 for processing an incoming call. 

Fig. 7 is a message flow diagram of messages exchanged between network elements 
in the communications system of Fig. 1 for placing an outgoing call. 

Detailed Description 

In the following description, numerous details are set forth to provide an 
understanding of the present invention. However, it will be understood by those skilled in the 
art that the present invention may be practiced without these details and that numerous 
variations or modifications from the described embodiments may be possible. For example, 
although reference is made to Session Initiation Protocol (SIP) communications sessions in 
accordance with some embodiments, other protocols may be performed in further 
embodiments. 

Referring to Fig. 1, a communications system 10 includes a first data network 12 and 
a second data network 14 that are coupled through a data network cloud 16. The data 
network cloud 1 6 may include various links, communications paths, and routers for routing 
messages between data networks 12 and 14. The data network cloud 16 may include a public 
network such as the Internet. The data networks 12 and 14 may be private networks such as 
local area networks (LANs) or wide area networks (WANs). In the ensuing discussion, one 
or some combination of the data networks 12 and 14 and data network cloud 16 may be 
referred to collectively as the data network 1 1. As used here, a "data network" or "network" 
may refer to one or more communications networks, channels, links, or paths and systems 
(such as routers) used to route data over such networks, channels, links, or paths. 

The data network 1 1 may include an Internet Protocol (IP) network, which is a 
packet-switched network. One version of EP is described in Request for Comments (RFC) 
791, entitled "Internet Protocol," dated September 1981. Other versions of EP, such as IPv6, 
or other connectionless, packet-switched standards may also be utilized in further 
embodiments. A version of IPv6 is described in RPC 2460, entitled "Internet Protocol, 
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Version 6 (IPv6) Specification," dated December 1998. Packet-switched data networks such 
as IP networks communicate with packets, datagrams or other units of data over the data 
networks. Unlike circuit-switched networks, which provide a dedicated end-to-end 
connection or physical path for the duration of a call session, a packet-switched network is 
one in which the same path may be shared by several network elements. Packet-switched 
networks such as BP networks are based on a connectionless internetwork layer. Packets or 
other units of data injected into a packet-switched data network may travel independently 
over any path (and possibly over different paths) to a destination point. The packets may 
even arrive out of order. Routing of the packets is based on one or more addresses carried in 
each packet. 

The packet-based network 12 may also be connection-oriented, such as an ATM 
(Asynchronous Transfer Mode) network or a Frame Relay network. In a connection- 
oriented, packet-based network, a virtual circuit or connection is established between two end 
points. In such connection-oriented networks, packets are received in the same order in 
which they were transmitted. 

Network elements connected to the data network 1 1 may also be coupled through a 
data network-PSTN gateway 20 to a public-switched telephone network (PSTN) 22. The link 
between the gateway 20 and the PSTN 22 may be a primary rate interface (PRI) link 
according to ISDN (Integrated Services Digital Network). Standard non-data network 
telephones 24 may be coupled to the PSTN 22. Call sessions can thus be established between 
a data network element and one of telephones 84. 

In the example embodiment as illustrated in Fig. 1, audio (e.g., voice) and multimedia 
(e.g., audio and video) communications may occur over the data network 1 1 between or 
among various network elements, including network telephones 30 and 34 and call control 
systems 32 and 36. Other devices capable of voice or multimedia sessions include SIP 
(Session Initiation Protocol) client systems 38 and 40. The SIP client systems 38 and 40 are 
capable of communicating using SIP messaging to establish call sessions. As used here, a 
"call session" refers generally to either a voice or a multimedia session established between 
two or more elements coupled to the data network 1 1 (or any other packet-switched data 
network). SIP is part of the multimedia data and control architecture from the Internet 
Engineering Task Force (IETF). A version of SIP is described in RFC 2543, entitled "SIP: 
Session Initiation Protocol," dated August 1999. SIP may be used to initiate call sessions as 
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well as to invite members to a session that may have been advertised by some other 
mechanism, such as electronic mail, news groups, web pages, and other mechanisms. The 
other protocols in the IETF multimedia and control architecture include the Resource 
Reservation Protocol (RSVP), as described in RFC 2205; the Real-Time Transport Protocol 
5 (RTP), as described in RFC 1889; the Real-Time Streaming Protocol (RTSP), as described in 
RFC 2326; the Session Description Protocol (SDP), as described in RFC 2327; and the 
Session Announcement Protocol (SAP). 

Other standards may be employed in further embodiments for controlling call sessions 
over the data network 1 1 . Such other standards may be any other standard that provides for 

10 interactive, real-time voice communications over the data network. 

The SEP client systems 38 and 40 as shown in Fig. 1 include client application 
programs that are capable of sending SIP requests to perform call requests. The systems 38 
and 40 may also be SIP servers. A server according to SEP may be an application program 
that accepts SIP requests to service calls and to send back responses to SEP requests. Thus, a 

15 system can be either a SIP client or a SIP server. A SIP proxy system, such as system 42, 
may include an intermediary program that acts as both a server and a client for making 
requests on behalf of other clients. 

In the system 10 as shown in Fig. 1, the call control systems 32 and 36 are SEP- 
enabled; that is, the call control systems 32 and 36 are capable of sending and accepting SEP 

20 requests to establish call sessions.. The call control systems 32 and 36 may be implemented 

on a standard computer system platform. Unlike the call control systems 32 and 36, however, 
the network telephones 30 and 34 are not SIP-enabled in one embodiment. Although they are 
capable of communicating audio data over the data network 1 1, the network telephones 30 
and 34 are not enabled to send or accept SEP messages (or other types of messages for 

25 establishing interactive, real-time voice communications) to establish call sessions. In 

accordance with some embodiments, the establishment, management, and termination of call 
sessions are controlled by the call control systems 32 and 36. Thus, the call control system 
32 makes SIP requests on behalf of the network telephone 30, while the call control system 
36 makes SEP requests on behalf of the network telephone 34. Once a call session is 

30 established, the network telephone 30 or 34 participates in the communication of voice data 
over the network 11. 
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By employing the arrangement as shown in Fig. 1, the superior voice capabilities of 
network telephones 30 and 34 may be utilized to provide enhanced voice quality for users 
making telephony calls over the data network 1 1 . At the same time, associated call control 
systems 32 and 36 are used to provide call signaling communications and to provide the user 
with a convenient user interface to perform call control as well as display information 
associated with the call session. 

The call control system 32 and the network telephone 30 may be collectively referred 
to as a telephony system 31 . Similarly, the call control system 36 and network telephone 34 
may be collectively referred to as a telephony system 33. To establish a call session between 
the telephony system 31 or 33 and another SIP-enabled remote system 100, as shown in Fig. 
3, the call control system 32 or 36 sends SIP messages to the remote system 100 to establish a 
call session. The remote system 1 00 may be any system or device on the data network 1 1 
that is capable of participating in a SEP-established call session. The call control system 32 or 
36 also exchanges commands according to a predetermined format with the network 
telephone 30 or 34 to let the network telephone 30 or 34 know of the current status of the call 
setup. Once a call is established, a link may be established between the network telephone 30 
or 34 and the remote system 100 over the data network 11. The link may be a Real-Time 
Protocol (RTP) link to communicate with voice data. Thus, in the telephony system 3 1 or 33, 
the call control system 32 or 36 communicates the control signaling to establish a call 
session, while a real-time link is established directly between the network telephone 30 or 34 
and the remote system 100 for communicating voice or other types of audio data. In one 
embodiment, the call control messaging between the call control system and remote system, 
the control messaging between the call control system and the network telephone, and the call 
session between the network telephone and the remote system all occur over the data network 
11. 

The call control system 32 or 36 is also equipped with speech processing elements to 
allow it to communicate voice data with other devices on the data network 1 1 . Thus, a user at 
the call control system 32 or 36 may select whether to use the call control system or the 
network telephone as the terminal device in the established call session. In addition, if the 
call control system 32 or 36 is powered off, the network telephone 30 or 34 may be used as a 
stand-alone device to communicate voice in call sessions over the data network 11. 
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Referring to Fig. 2, the components in the network telephone 30 or 34 and in the call 
control system 32 or 36 are illustrated in greater detail. The network telephone 30 or 34 
includes a network interface 102 that is coupled to the data network 1 1 . Above the network 
interface 102 are several software layers, including a device driver layer 104, a TCP/IP or 
5 UDP/EP stack 106, and an RTP layer 108. TCP is described in RFC 793, entitled 

"Transmission Control Protocol," dated September 1981; and UDP is described in RFC 768, 
entitled "User Datagram Protocol," dated August 1980. TCP and UDP are transport layers 
for managing connections between network elements over an IP network. Packets received 
by the network interface 102 are passed up through the several layers 104, 106 and 108. 

10 Control packets are transmitted by the TCP/IP or UDP/IP stack 106 to one or more control 
tasks 1 10 in the network telephone 30 or 34. The one or more control tasks 1 10 may be 
implemented as software routines executable on a control unit 112. Instructions and data 
associated with the control tasks 110 may be stored in a storage device 1 14. The control 
taisks 110 are responsible for generation of control signaling as well as exchanging commands 

15 and responses with its associated call control system 32 or 36 over the data network 11. 

Voice data may be passed through the RTP layer 1 08 to a speech processing 
application 1 16, which may also be executable on the control unit 112. For faster processing 
of voice data, a digital signal processor (DSP) 1 18 is included in the network telephone 30 or 
34 to provide data intensive signal processing tasks. For example, the coder/encoder 

20 (CODEC) may be implemented in the DSP 118. The network telephone may also include a 
display screen to display text data associated with a call session. The size of the display 
screen 120 may be limited so that only limited amounts of text data may be displayed in the 
display screen 120. The network telephone also includes numerals buttons that may be 
controlled by button control circuitry 122. The buttons may include numeric buttons, speed 

25 dial buttons, a transfer button, a hold button, a redial button, and other telephony buttons. 
Activation of any one of the buttons may cause generation of some type of an indication 
(such as an interrupt) that is forwarded to the control tasks 110. 

The call control system 32 or 36 also includes a network interface 150. Above the 
network interface 150 are several layers, including a device driver layer 152, a TCP/IP or 

30 UDP/IP stack 154, a SIP stack 156, and an RTP layer 158. The SIP stack 156 is responsible 
for processing or generating SIP requests and responses communicated over the data network 
1 1. The SIP stack 156 is in communication with one or more control tasks 160 in the call 
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control system 32 or 36. The SIP stack 156 is generally a state machine that provides 
parsing, processing, and generation of SIP requests and responses. 

The call control tasks 1 60 are responsible for generating control signaling to establish 
call sessions over the data network 1 1 as well as to respond to received control signaling. In 
addition, the control tasks 160 are responsible for exchanging commands and responses with 
the network telephone 30 or 34 to establish such call sessions. The call control system 32 or 
36 may also include one or more graphical user interface (GUI) routines 162 that control the 
presentation of information (text or graphical) on a display 164 of the call control system. 
Further, the user interface provided by the GUI routines 162 may include selectors for call 
control and indicators of the status of a call session. 

In the illustrated arrangement, the RTP layer 1 58 sends audio data to, or receives 
audio data from, a CODEC 166. The CODEC 166 encodes or decodes voice data. A speech 
processing routine 168 may perform further processing of voice data. In further 
embodiments, the audio CODEC 166 and the speech processing routine 118 may be omitted. 
The various software routines in the call control system 32 or 36, including the various layers 
152, 154, 156, and 158 as well as the control tasks 160, CODECS 166, speech processing 
routine 168, and GUI routine 162, are executable on a control unit 170. The control unit 170 
is coupled to a storage device 172 in which instructions and data associated with the various 
software routines may be stored. 

In the illustrated example arrangement, to provide a voice or audio user interface to a 
user sitting at the call control system 32 or 36, a peripheral controller 174 is coupled to a 
microphone 176 and a speaker or head phone 178 through which a user can talk or listen 
during a call session. If the call control system 32 or 36 is not speech-enabled, the 
microphone 1 76 and speaker or head phone 1 78 may be omitted. 

One call control system 32 or 36 may be associated with a corresponding network 
telephone 30 or 34. Thus, the network telephone 30 or 34 can identify which device is its 
controller. Similarly, a call control system 32 or 36 can identify the network telephone it is 
controlling. The network telephone 30 or 34 includes one or more fields 120 in the storage 
device 1 14 to store an identifier of its call controller, in this case the call control system 32 or 
36. The identifier may be in the form of a network address and port number. For example, 
an IP address and a TCP or UDP port may form part of the identifier of the call controller 
120. Similarly, the call control system 32 or 36 stores one or more fields 180 in the storage 
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device 172 that stores the identifier of the network telephone it is controlling. Again, the 
identifier 1 80 may be in the form of a network address and port number, such as an IP 
address and a TCP or UDP port number. The identifier stored in the field 120 of the network 
telephone may be changed by a user to change the associated call control system. Similarly, 
5 the identifier stored in the field 1 80 of the call control system may be modified to change the 
controlled network telephone. 

In further embodiments, one call control system may be associated with plural 
network telephones. Also, a single network telephone may be associated with plural call 
control systems. 

10 The various control units in the network telephone 30 or 34, the call control 32 or 36, 

and any other system or device on the data network 1 1 may each include a microprocessor, a 
microcontroller, a processor card (including one or more microprocessors or controllers), or 
other control or computing devices. The storage devices referred to in this discussion may 
include one or more machine-readable storage media for storing data and instructions. The 

15 storage media may include different forms of memory including semiconductor memory 
devices such as dynamic or static random access memories (DRAMs or SRAMs), erasable 
and programmable read-only memories (EPROMs), electrically erasable and programmable 
read-only memories (EEPROMs) and flash memories; magnetic disks such as fixed, floppy 
and removable disks; other magnetic media including tape; and optical media such as 

20 compact disks (CDs) or digital video disks (DVDs). Instructions that make up the various 
software routines, modules, or layers in the various network elements may be stored in 
respective storage devices. The instructions when executed by a respective control unit cause 
the corresponding network element to perform programmed acts. 

The instructions of the software routines, modules or layers may be loaded or 

25 transported to the network element in one of many different ways. For example, code 

segments including instructions stored on floppy disks, CD or DVD media, a hard disk, or 
transported through a network interface card, modem, or other interface device may be 
loaded into the system and executed as corresponding software routines, modules, or layers. 
In the loading or transport process, data signals that are embodied in carrier waves 

30 (transmitted over telephone lines, network lines, wireless links, cables, and the like) may 

communicate the code segments, including instructions, to the network element. Such carrier 
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waves may be in the form of electrical, optical, acoustical, electromagnetic, or other types of 
signals. 

Referring to Fig. 4, in accordance with one embodiment, a screen 200 that may be 
provided by the control tasks 160 and graphical user interface routines 162 in the call control 
system 32 or 36 is illustrated. The screen 200 as shown in Fig. 4 includes various icons and 
items (generally referred to as indicators) to allow a user sitting at the call control system to 
initiate, terminate, and screen calls over the data network 11. In the example shown in Fig. 4, 
the screen 200 includes a menu 202, a series of control buttons 204, and a list 206 of potential 
callees. The list 206 provides the first and last names of potential callees as well associated 
electronic mail addresses (or other information such as telephone numbers and so forth). As 
illustrated in Fig. 4, the name R. Smith may be highlighted in the list 206. The address of R. 
Smith is displayed in an address field 208. The address field 208 may include various 
formats, such as a PSTN number (e.g., 972-555-1234); a PSTN number and a proxy address • 
(e.g., 972-555-1234 @ CTEXI300); an IP address (e.g., 47.161.18.72); a SIP address (e.g., . 
rsmith@nortelnetworks.com); or a SIP address at a specific IP address (e.g., 
rsmith@47.161. 18.72). Identifiers according to other formats may be illustrated in the 
address field 208 in further embodiments. : 
A status field 212 may also be included in the screen 200, which may show the status 
as "not in call," "outgoing call to R. Smith," "incoming call from R. Smith," and so forth. A 
plurality of indicators 214 may also be provided in the screen 200. A C indicator flashes 
when an incoming call has been missed. An S indicator gives an indication that call 
screening is active. A P indicator gives an indication that a SIP proxy is in use or not in use. 
An E indicator gives an indication of the state of the associated network telephone. Thus, the 
E indicator is at a first state if the network telephone is not active and at a second state if the 
network telephone is active and available. The E indicator may also be at a third state to 
indicate that a call is currently in progress. 

The screen 200 is also capable of providing a pop-up menu 210 to allow a user to 
select one or several methods of contacting the desired callee. For example, a first option in 
the pop-up menu 210 is to call R. Smith. Another option is to send an electronic mail to R. 
Smith. A third option is to go to R. Smith's web site. 
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Other call control operations that may be performed by a user through the screen 200 
includes volume control, screening of incoming calls, termination of a call session, and other 
operations. 

Referring to Fig. 5, once a call is established with either a caller or a callee, another 
5 screen 300 may be shown. A picture of the caller or callee may be displayed in the screen 
300. An icon 304 may be provided to allow the user to hang-up the call, and another icon 
306 may be provided to allow the call to be placed on hold. A status field 308 indicates the 
current status of the call. 

Referring to Fig. 6, a message flow between a network telephone, a call control 

10 system, and a remote system is illustrated. According to SIP, messages that may be 

exchanged between network elements include requests and responses. The remote system 
may be another call control system, one of the SIP client systems 38 and 40, the data 
network-PSTN gateway 20, or any other system capable of establishing a call session on the 
data network 1 1 . The remote system first sends (at 402) an Invite request (according to SIP) 

15 to the call control system. The Invite request indicates that the receiving node is being 
invited to participate in a session. The message body of the Invite request contains a 
description (e.g., in SDP format) of the session to which the receiving node is being invited. 

The call control system may then send (at 404) a Ringing (SIP) response back to the 
remote system. The Ringing response indicates that the called user agent has located a 

20 possible location where the user has registered recently and is trying to alert the user. The 
call control system may then send (at 406) a ConnectionJReq message to the network 
telephone to initiate a connection between the call control system and the network telephone. 
The messaging format between the network telephone and the call control system may be any 
predetermined format that allows call establishment and control to be performed by the call 

25 control system with the network telephone. One such format is the Unified Networks IP 

Stimulus Protocol, Draft Version 2.1, dated December 7, 1999. In further embodiments, other 
interface protocols may be employed. A description of one embodiment of a protocol for 
message exchange between the network telephone and the call control system is provided in 
U.S. Patent Application Serial No. 09/307,356, entitled "Telephony and Data Network 

30 Services at a Telephone," filed on May 7, 1999, which is hereby incorporated by reference. 

The Connection_Req message is a generic message which includes one or more 
commands that indicates a request to establish a connection. The Connection_Req message 
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may actually include a ring command to activate the ringer of the network telephone and 
other commands to activate the network telephone, such as activation of the handset, headset, 
microphone, speaker, and so forth. The network telephone may then send back (at 408) an 
Ack_Req message to the call control system to acknowledge that the network telephone is 
available and ready. The Ack_Req message may also be a generic message to acknowledge 
receipt of the Connection_Req message. Upon receipt of Ack_Req message from the 
network telephone, the call control system sends (at 410) a 200 OK SIP response to the 
remote system to indicate that the request has succeeded. The remote system then sends (at 
412) an Ack request (according to SIP) to the call control system. The Ack request confirms 
that the client has received a final response to an Invite request. 

Upon receipt of Ack request, the call control system sends (at 414) a Remote_Answer 
message to the network telephone to indicate a request to establish a path for a call session. 
If accepted, the network telephone then sends (at 416) an Ack_Answer message back to the. 
call control system. The Remote_Answer message may be a generic message that includes 
one or more commands to activate the network telephone for call session. One such 
command is a command to open or connect the audio stream to the handset, headset, 
microphone and speaker of the network telephone. At that point, a voice path is established . 
(at 418) directly between the network telephone and the remote system. The voice path may 
be an RTP link over the data network 11. r 

To terminate the call, the remote system may issue (at 420) a Bye request to the call 
control system. The call control system then responds (at 422) with a 200 OK, indicating that 
the call has been terminated. Then, the call control system sends (at 424) a Disconnect_Req 
message to the network telephone to disconnect the network telephone from the data network. 
The Disconnect_Req message be a generic message including one or more commands to 
deactivate various components of the network telephone. For example, the audio stream may 
be closed or disconnected, and the handset, headset, microphone, and speaker may be 
deactivated. The network telephone then returns (at 426) an Ack_Disconnect message back 
to the call control system to indicate that the call has been disconnected. 

Referring to Fig. 7, an outgoing call message flow is illustrated. In the illustrated 
example, the user can initiate the call from the call control system. However, the user can 
also make the external call from the network telephone by entering the desired number in 
appropriate buttons of the network telephone. In that case, messages are exchanged between 
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the network telephone and the call control system initially to indicate to the call control 
system that the user has started a phone call from the network telephone. 

To start the call session, the call control system sends (at 502) an Invite request to the 
remote system. The remote system then sends back (at 504) a Ringing response. In 
5 response, the call control system sends (at 506) a Remote_Alerting message to the network 
telephone indicating that the call has been placed. The network telephone then returns (at 
508) an Ack_Alerting message. At some point, the remote system, once it has answered the 
call, issues (at 510) a 200 OK message to the call control system. In response, the call 
control system then sends (at 512) an Ack request back to the remote system. The call 

10 control system also sends (at 514) a Remote_Answer message to the network telephone, 

which returns (at 516) an Ack_Answer message to the call control system. At that point, a 
voice path (e.g., an RTP path) is established (at 518) between the network telephone and the 
remote system over the data network 11. 

To terminate the call, the remote system may issue (at 520) a Bye request. In 

15 response, the call control system may terminate the call by sending (at 522) a 200 OK 
message. The call control system then sends (at 524) a Disconnect_Req message to the 
network telephone, which returns (at 526) an Ack_Disconnect message to the call control 
system. At this point, the RTP voice path is terminated. 

While the invention has been disclosed with respect to a limited number of 

20 embodiments, those skilled in the art will appreciate numerous modifications and variations 
therefrom. It is intended that the appended claims cover all such modifications and variations 
as fall within the true spirit and scope of the invention. 


WO 01/69899 


PCT/USO 1/07686 


- 15- 

What is claimed is: 

1 1 . A method of communicating over a data network, comprising: 

2 providing a user interface in a control system for establishing call sessions; 

3 communicating, by the control system, one or more control messages over the 

4 data network to establish a call session with a remote device in response to receipt of a 

5 request through the user interface; and 

6 transmitting one or more commands to a voice device connected to the data 

7 network and associated with the control system to establish the call session between the voice 

8 device and the remote device over the data network. 

1 2. The method of claim 1, wherein the communicated one or more control 

2 messages and the transmitted one or more commands are according to different formats. 

1 3. The method of claim 1, wherein transmitting the one or more commands to the 

2 voice device includes transmitting one or more commands to a network telephone including a 

3 network interface to the data network. 

1 4. The method of claim 1 , wherein establishing the call session includes 

2 establishing a Real-Time Protocol session over the data network, 

1 5. The method of claim 1, wherein communicating the one or more control 

2 messages includes communicating messages according to a protocol defining real-time, 

3 interactive call sessions over a packet-switched data network. 

1 6. The method of claim 1, wherein communicating the one or more control 

2 messages includes communicating one or more Session Initiation Protocol messages. 

1 7. The method of claim 1, further comprising storing, in the control system, an 

2 identifier of the voice device. 

1 8. The method of claim 7, wherein storing the identifier includes storing an 

2 Internet Protocol address and a port of the voice device. 
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1 9. The method of claim 1, further comprising receiving an indication from the 

2 . voice device to establish another call session with the remote device. 

1 10. The method of claim 1, further comprising displaying graphical user interface 

2 information of the call session on the control system. 

1 11. The method of claim 1 , further comprising terminating the call session using 

2 either the user interface or the voice device. 

1 12. A method of communicating over a data network, comprising: 

2 in a control system, communicating one or more control messages over the 

3 data network to establish a call session with a remote device coupled to the data network; 

4 transmitting one or more commands to a voice device coupled to the data 

5 network 

6 establishing the call session between the voice device and the remote device 

7 over the data network; and 

8 displaying information associated with the call session on the control system. 

1 13. The method of claim 12, wherein displaying the information includes 

2 displaying graphical user interface information. 

1 14. The method of claim 12, wherein communicating the one or more control 

2 messages includes communicating Session Initiation Protocol messages. 

1 15. The method of claim 12, further comprising providing one or more indicators 

2 for call control in the control system. 

1 16. The method of claim 12, further comprising communicating Real-Time 

2 Protocol messages between the voice device and the remote device over the data network. 

1 17. The method of claim 12, further comprising identifying, in the control system, 

2 an address of the voice device to be controlled by the control system. 
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1 8. The method of claim 12, further comprising providing a user interface on a 
display of the control system, the user interface enabling selection of one or more criteria 
associated with the voice device. 

19. The method of claim 1 8, wherein the one or more criteria includes selection of 
the voice device for use in a voice session established by the control system. 

20. The method of claim 19, wherein the one or more criteria includes an 
identifier of the voice device. 

2 1 . The method of claim 12, further comprising providing voice processing 
components in the control system and selecting one of the voice processing components and 
the voice device to communicate in the established call session. 

22. The method of claim 2 1 , further comprising receiving user selections entered 
in a user interface of the control system to select one of the voice processing components and 
the voice device. 

23 . The method of claim 2 1 , further comprising redirecting selection to the other 
one of the voice processing components and voice device. 

24. The method of claim 1 2, wherein the data network includes a packet-switched 
data network. 

25. A system for controlling a voice device connected to a data network, 
comprising: 

a user interface including one or more selectors for call control relating to call 

sessions; 

a controller adapted to receive a request from the user interface and to 
generate one or more messages for communication over the data network to establish a call 
session with a remote device; and 
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8 an interface to transmit one or more commands relating to the call session to 

9 the voice device to establish a link between the voice device and the remote device over the 
10 data network. 

1 26. The system of claim 25, wherein the one or more messages include Session 

2 Initiation Protocol messages. 

1 27. The system of claim 26, further comprising a module to process the one or 

2 more Session Initiation Protocol messages. 

1 28. The system of claim 25, wherein the interface includes a network interface for 

2 coupling to the data network. 

1 29. The system of claim 25, further comprising a storage element including an 

2 identifier of the voice device. 

1 30. The system of claim 25, wherein the user interface includes one or more 

2 elements to display information relating to the call session. 

1 31 . The system of claim 30, wherein the information includes graphical 

2 information. 

1 32. A computer program capable of running in a system so that the system so 

2 programmed carries out a method for controlling voice communications over a data network, 

3 the method comprising: 

4 providing a user interface in the system to display information associated with 

5 a call session; 

6 communicating one or more control messages over the data network with a 

7 remote device to establish the call session between a voice device and the remote device; and 

8 controlling the voice device during the call session. 
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1 33. The computer program of claim 32, wherein communicating comprises 

2 communicating Session Initiation Protocol messages. 

1 34. The computer program of claim 32, wherein the method further comprises 

2 displaying a picture of a callee. 

1 35. The computer program of claim 32, wherein the method further comprises 

2 displaying icons selectable by a user for call control. 

1 36. A computer program product comprising at least one storage medium having 

2 thereon computer program code means to make a system execute a method for controlling a 

3 call session over a data network, the method comprising: 

4 providing a user interface in the system for establishing the call session; 

5 communicating one or more control messages over the data network to 

6 establish the call session with a remote device in response to a request received through the 

7 user interface; and 

g transmitting one or more commands to a voice device connected to the data 

9 network and associated with the control system to establish the call session between the voice 
10 device and the remote device over the data network. 
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